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Abstract. This paper completes the statistical modeling 
of the Hubble flow when a Tully-Fisher type relation is 
used for estimating the absolute magnitude M ap + h 
from a line width distance indicator p. Our investigation 
is performed with the aim of providing us with a full un- 
derstanding of statistical biases due to selection effects in 
observation, regardless of peculiar velocities of galaxies. 
We show that unbiased iJo-statistics can be obtained by 
means of the maximum likelihood method as long as the 
statistical model can be deflned. We focus on the statis- 
tical models related to the Direct, resp. Inverse, Tully- 
Fisher relation, when selection effects on distance, resp. 
on p, are present. It turns out that the use of the Inverse 
relation should be preferred, according to robustness cri- 
teria. The formal results are ensured by simulations with 
samples which are randomly generated according to usual 
characteristics. 
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Introduction 



^ The statistical biases that affect the analysis of cosmic ve- 
locity flelds, when a Tully-Fisher (TF) relation is used in 
the distance estimate of galaxies (see Tully & Fisher 1977, 
Faber & Jackson 1976) is an important problem in obser- 
vational cosmology (see e.g., Teerikorpi 1975, Schechter 
1980). At present, among different approaches, there is 
a general agreement on the usual bias deflnition, which 
says that a statistic is biased if its expected value does 
not correspond to the model parameter for which it has 
been made up (see e.g., Teerikorpi 1994, Strauss & Willick 
1994). Such a statement shows clearly the necessity of 
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deflning a statistical model (e.g., for checking whether a 
statistic is biased and for calculating the related correc- 
tion, see e.g., Triay 1993, Willick 1994). With this in mind, 
we understand that unbiased statistics can be obtained 
as long as the probability density (pd) describing the data 
can be defined. Moreover, nothing prevents us to use solely 
the maximum likelihood (ml) technique, which provides us 
unambiguously with a unique fitting technique, and pre- 
vents us from subjective speculations on diagrams, Triay 
(1994). In the present fields of interest, the problem of 
biases is related to the question of whether the selection 
effects in observation are indeed described by the statisti- 
cal model. At first glance, there is however an additional 
difficulty, which is due to the not yet solved problem of 
the choice between the frequentist and the Bayesian ap- 
proaches, see Hendry et al. (1994). In this paper, we do not 
address this question and we limit ourselves to complete 
the Bayesian approach developed in Triay et al. (1994) 
investigation (herein TLR), which consists on the statis- 
tical modeling of the Hubble fiow, regardless of peculiar 
velocities of galaxies. The aim of such an investigation 
is to provide us with a full understanding of statistical 
biases due to selection effects in observation, when the 
Direct or the Inverse TF models (herein, DTF and ITF) 
are used (see e.g., Teerikorpi 1982,1984,1987, Bottinelli 
etal.l985,1986a,1986b,1988a,1988b, Tully 1988a,1988b). 
A consensus can be found by arguing on the statistical 
model, instead of the technique of fitting, which shows 
that the estimates of galaxies distances and Hq are not 
model dependent, contrarily to calibration parameters of 
the TF relation (see TLR, Rauzy 1994, Rauzy&Triay 
1995). A sensible choice of the model has to be moti- 
vated solely by reasons of robustness of statistics, which 
depends on selection effects in observation. Section 3 gives 
the (unbiased) iJo-statistic within the ITF-model when 
selection effects on the line width distance estimator are 
present, and Sect. 4 for selection effects on distance (or re- 
cession velocity) within the DTF-model (ITF-model with 



tion effects on fine width distance estimator are treated in 
TLR). In order to have a visuaf support for our theoreti- 
caf approach and to estimate the magnitude of biases, we 
perform numericai simuiations in Sec. 5. The mathemati- 
cai framework is specified in Sec. 2. Notations and useful 
formulas are given in Appendix A, these features are ad- 
dressed throughout the text by means of symbol "_De/," . 

2. The basic model 

The difference between the ITF or the DTF relations for 
obtaining the absolute magnitude of sources M ^ a p + h, 
from a log line width distance estimator p, interprets es- 
sentially within a framework of the statistical modeling of 
data. The first step of such a process is to enumerate the 
variables involved in the calculation, which are : - the ab- 
solute magnitude M , - the line width distance estimator 
p, - the distance modulus = 25 -|- 5 log r (r is the dis- 
tance from the observer in Mpc), - and a similarly defined 
quantity rj = 25 -|- 5 log t;, which accounts for the recession 
velocity v. If the peculiar velocities of sources are neglected 
then the Hubble law can be written as follows 



Pth(4') is the normalization factor, see (Def.3). For defin- 
ing the pd dPth, one assumes working hypotheses on the 
distribution of the intrinsic quantities M , jj, and p. If no 
luminosity evolutionary effect of sources is present then 
we can write 



dPth = F(M,p)dMdp K{ii)dii, 



(7) 



where accounts for the distribution of galaxies in 

space and F(M,p) for the M-p distribution. The differ- 
ence between the ITF and the DTF models lies on de- 
scribing this pdf, one has 



F(M,p)dMdp « ^0, g(C; 0, ac)dC, 



(8) 



where ^ depend on the choice of the model which "mimics" 
the TF-relation : 



M in the ITF model 
p in the DTF model. 



(9) 



T] = fi + Ti, 

where 

n = 5iogi?o. 

The apparent magnitude is given by, 
m = M + /J,. 



(1) 



(2) 



(3) 



Finally, for clearness in reading, we use the following vari- 
ables 



X = m — r], 

y = a.p + h + rj — m. 



(4) 
(5) 



The analysis of the Hubble fiow involves two steps : 

1. The calthratton of the TF-relatton M a.p -\- 
h. The statistics of model parameters a and 
h are written in terms of observable quantities 

2. The determmatton of the Hubble constant. The Hq- 
statistics are written in terms of observable quantities 

{xk, yk}k=l^N2- 

The second step is to write the pd describing the distribu- 
tion of above variables (according to working hypotheses) 
which characterizes the statistical model. Let us write 



dPo 



bs 



dPt 



th, 



(6) 



the parameter ^o, resp. a^, denotes the mean, resp. the 
standard deviation, and the random variable ( describes 
the scatter of the TF-relation, it has necessarily a zero 
mean, cr^ denotes the related standard deviation. We as- 
sume : 

•ho) a /zwear TF-relation, 

( = a.p + h- M, (10) 
•hi) a Gaussian ^-distribution 

9(C;0,tc) = 9g(C;0,tc)- (n) 

The description of the M-p distribution will be completed 
in the next sections. About the spatial distribution of 
sources, we assume that it is given by 

•/}2) a power law 

k(ij,) (X exp(/3//), (12) 

where /3 = 3 In 10/5 accounts for a uniform distribution. A 
part of selection effects is described in terms of apparent 
magnitude, (i.e., (f) cx (/)„) and we assume : 

•hs) a cutoff at a given limiting magnitude mam, 

ij>m(m) = 0(mri^ - m), (13) 

where 6 denotes the Heaveside distribution function. 

3. p-limited samples in the ITF model 

The ITF-model is specified by the luminosity distribution 
function, we assume that : 

•/}4) it is Gaussian 

/m(M;Mo,(Tm) = ffG(M;Mo,(TM). 
The pd describing the data reads 

,p <i>m{'m)<j>p(p) 



where (f) is a function written in terms of observable quan- 
tities which describes the selection effects in observation 
(it is called selection function), dPth is the pd describing 



Pth((t)m(t)p) 



(14) 



(15) 



where and dPth are given by Eq. (7-14), 4'pip) accounts 
for p-selection effects, and it works as : 



(j,p(p) = 0(p - Pmm)0{Pmax " p) ■ (16) 

The normalization factor Pth{4'm4'p)j see (Def.3), reads 



Pth((t)m(t)p) = J 0(p - Pmin)0(pmax - P)adp 

gcAM; M„,aM)9GiM; a.p + b, ac_)dM 
0{mim - M - fi) exp(j3fi)dfi, (17) 

provided the working hypotheses (/iq-Zis). The inner inte- 
gration over /J provides us with the term ^-le''™'™ e~^^ , 
and according to (Def. 2c), we have 

e-^^gGiM;Mo,aM) = e-^^^^-i^<^ 

xgG(M;M,-l3alf,aM). (18) 

Hence the integration over M gives 

gG(M;M,-l3alf,aM)gG(M;a.p+h,ac)dM (19) 
= gG(a.p+h;M,-l3alf,a^), (20) 



where 



see (Def. 2d). Finally, we obtain 

Pthi(f'm(f>p) = ApPthi(f>m), 

where 



1 



Pthi<l>m) = exp /3 ( miim - Mo + '--alj 



I 

2 



(21) 



(22) 



(23) 



does not depend on a, & and <tq, and 

Apia, h, (Tq) = Wp(pmax) - <^p{Pmm), 

with 

ujp(p) = Af(v(p)) , 
v{p) = ^{ap + b- Mo+ fia'ii) , 

where M denotes the erfc function, see (Def. Ic). 



(25) 
(26) 



3.1. Calthratton statistics 

According to TLR, if p-selection effects are not present in 
the observations then the model parameters a, h and <tq 
can be measured as follows 



JTF 



MM? 
Coy lip. My 



pI(p,m) 



(27) 
(28) 
(29) 



</'m, /m and K. In the other hand, if p-selection effects 
are present then these functions have to be specified for 
obtaining calibration statistics. The only reason is that 
the normalization factor depends on model parameters, 
through Ap(a, h, cr^) which reads in terms of /m and 
K, see Eq. (22). According to working hypotheses (ho-h^), 
the efficient part of the likelihood function (If), see (Def.A), 
is given by 

C\{a,b,ac) = -lnAp{a,b,ac) + C^;^^{a,b,ac), (30) 
where 



£r=lna-ln.c--^E 



1 (a.pk + b-Mk)' 



k = l 



2(tI 



(31) 



is the likelihood function when no p-selection effect is 
present. For convenience in notation, let us define the func- 
tions 



da A 

^a{a,b,a() = 



(32) 



with a = a,b, u'^. Hence, the ML equations are given by 



a{p{ap+b- M))i = al(l-a'da), 
{ap+b-M)i = -(T^2^j, 

{{ap + b-M)^)i = al(l + 2al^,2^ 



(33) 
(34) 

(35) 



see (Defb). According to Eq. (24-26), the generic form for 
these terms reads 



dAp{a, b, (tJ = (9Wp(pmax) - duip{p,nm) 

where 

P9n{v{p)) 



(24) da^P) 



diujp{p) 



gN {v{p)) 



I s v{p)g^{v{p)) 
dol^pip) = ^(^T)^' 



(36) 

(37) 
(38) 
(39) 



Equation (34) gives straightforwardly the &-statistic. 
The a-statistic is obtained by substituting b into 
Eq. (33,35), according to Eq. (34). By arranging the terms 
in descending order of magnitude, we obtain^ 



JTF 



} + 7i {a{{p)i^i-^a) + <Tl (^2_2^^,))},(40) 
where a^^^ is given in Eq. (27), and 



71 



MM)' 



(41) 



^ A trick, which prevents us from cumbersome calculation 
the following equality (ap {a{p — (p)i) — (M — (M)i)))i = 
{a (p - {p}^) (a (p - {p}^) - (M - {M}^))}^ 



depend on the Hubble constant Hq for every functions 



{{ap + h- MY)i = {ap(ap +h- M))i + h{{ap + & - M))i and (t)p\ One has 



p 

-{M(ap + h-M)),. (42) ^ 

7 = — ~7i, (51) 
Cm 

The first two terms are substituted according to 

Eq. (33,34). The third one reads aCov(p, M) - (M) - and 

(T?(M)i«?j. Hence, we obtain an expression in terms of cr? / ox,, . ^ 

A i 2 , ■ , \ , f ^ n'^ = {M,-(]al,)-(x)2. 52 

and (T^, and the (T^-statistic interprets as the root oi a sec- ^ / > / 

ond order polynomial. Finally, the ML statistics are given The i?o-statistic given in Eq. (50) is Gaussian distributed 
by about 7i with a standard deviation given by 

I _ „ITF ^ i + lM 1 

(.43) cr^^iTF = , ^ ^==, (53) 



fci = {M),-a\p),-{al)'^,, (44) gq. (45) 

(•^J)' = ^ (^l + 8^,.((aip + &i-M)^)i- l) , (45) 



4. //-limited samples in the DTF model 



, ,T,T J- i- 1 £ 11 The DTF-model is specified by the ^-distribution function, 

where the are lunctions dehned by j r j 

and we assume that : 

*l(a, h, a^) = al (^^ _ 2^^.) , (46) '^''^^ Gaussian 

,T,I^ /. ^ ITFm /\a^ /p Po , <Tp) = (/G (p; Po , <Tp) . (54) 

*2(a,&,(T(;) = a (^a-(p)i^i)- (47) ,1 -K- fl, ^ f ^ 

ihe pa describing the data reads 

In practice, the estimates of parameters aK, a^,h K,h^ and (f)^(^m)(f> [ji) 

(tJ are obtained from Eq. (43-45) by Newton itera- dPohs = ~~p~^J^ ^ -j c^-Pth, (55) 
tive method. The derivation of prf/ related to calibration 

statistics is cumbersome, and useless because of the small where and dPt\i are given by Eq. (7-10,13-11,54), (/)^(//) 
number of sources that are used in general. The statistical accounts for //-selection effects, and it works as : 
behavior for these estimates is investigated by means of .^^^ ^ ^-^^^^^ distribution function 
simulations, see TLR. 

<I>^,{H) = 0{n - H,nm)d{Hinax - 1^)- (56) 

Hence, the normalization factor Pth{4'm4'ti)j see (Def.3), 
within hypotheses (ho-hs,h'^,h6) reads 



3.2. Deiermmaiion of Hq 



The pd given in Eq. (6) reads in terms of observable quan- r 

titles X, y and rj, see Eq. (1,4,5), as follows Pth{<^>m<^>^i) = - /^min)^'(/^max - ^^) 

"^^"^^ " pI(.?^„.?^p) -^^'^'^'^° ~ '^^"^'^"^'^ 9Gip;Po,(^p)9Gia.p+b;M,a()dp. (57) 

xd) ~^ ^ ~ ^ )q {y-% (7 )dv f48) "^^^ inner integration over p provides us with the term 

" ' a ' ^' ^' ^ ' gG(M;a.p, + h,a^), see (Def.2h,d), where 

Let us emphasize that if p-selection effects are not present ^ 
then this pd reads as a product of two independent pds, y p C 

which provided us with a robust iJo-statistic 



cr" = ,/aV2 -^0-2. (58) 



n'^^* = {y)2, (49) 



Hence, the /i-M domain of integration can be separated 
into two sub-domains : the first one is defined by M < 



2 



It becomes obvious by writing 



see TLR. Thus this statistic becomes biased because of 
the extra term (fip i 
statistic defined by 



(t>m{x' + ii')4>p{ )fM{x'; Mc,o-m) 

XK(r]')dx'dr]' gG{y' ; 0, a()dy' , 



j'PP T~t^^^ -\- 7^?^'^ /rn\ where the dummy variables x' = x + H, y' = y — H and 



n^'' = , ' , (50) , ^ 

1-1- 72 y ^ 1^ = 1^ - H 



nilim-/Umax < M < miim-//min and //mm < /U < mUm-M. 

Once the integration over /j, is carried out, the one over M 
terminates the calculation. Finally, by using the following 
variables 

/"o = mim - (a-Po + b), (59) 
u,,{ij) = W(«2(//)) exp(«i (//))- W(«3(//)), (60) 



vi(//) = /3(T ( — ^ /3— 



^^2(/^) 



IJ-- IJ-o 



D 



= 5 , 



(61) 

(62) 
(63) 



see (_De/,lb,c), the normalization factor can be written 

Pthi4>m4>^.)=A^Pthi4>m), (64) 

where 

Ptb{^^) = ^ exp/3 (^iJo + ^ (aV2 + af)^ , (65) 



and 

A^,(a, h, a() = W^(//max) - W^(//min)- 



Calthratton statistics 



(66) 



According to TLR, if //-selection effects are not present in 
the observations then the model parameters a, h and <tq 
can be estimated by the following statistics 

DTF Coy lip, M) 



(67) 

(68) 
(69) 



^DTF ^ Ei(M)^l-pf(p,M), 

provided hypotheses (ho-hs,h'^). When //-selection effects 
are present then Aft intervenes in the normalization factor, 
see Eq. (64). Since, both Pthi't'm) and Aft depend on a and 
h, the efficient part of the likelihood function reads 

£?(a, h, ac) = - In A^(a, h, a^) + CY^^(a, h, a^), (70) 

where 



1 ^{a.pk+b- Mkf 



DTF 



#1 ^ 

k = l 



2al 



1 x-^iPk- Po) 



^1 



(71) 



is the likelihood function when no //-selection effect is 
present. Hence, the ML equations are given by 

{p(ap + h - M))i = {/3(p„ - paa^ - } , (72) 

{ap + h-M)i = a^(p-gi), (73) 

{{ap+h - M f)i = a} (l + alip^ + 2^,= )) . (74) 



{{p-Po))i = (^liPa + Qp^), (75) 
{{p-Pof)i = al{l + al[a'(]'' -2e..)}, (76) 



where (for convenience in notation) the functions 



Qa (a, h, a() 



3 A 

An 



(77) 



According to Eq. (59-63,66), these last terms are calcu- 
lated as follows. One has 

dA^,(a, h, a() = (9w^(//max) - ^^^(/"min) (78) 

where 

5w^(//) = ((9v2fi'N (^2) + dviAf (v'j)) exp (vO-SvaffN (^3) ,(79) 
with 

davii^i) = l3{po-l3aal) , (80) 

(81) 



Q / X 1 / (// - //q) acTp 

daVsil^) = '9aW2(//) + /3(Tp^, 

(T 

dbviiji) = l3, 

dhV2{ji) = — ^, 
(T 

5^=-2(/.) = 2(,D)2 ,D - 

dp^Viifi) = adiVi(fi), 



(82) 
(83) 
(84) 
(85) 

(86) 
(87) 

(88) 

(89) 
(90) 



where i = 1,2,3 for Eq. (89,90). Finally, the ML equations 
yield the following statistics : 



„E) _ „DTF 

CI — Uj X .-1 T-^ , , 

l-7g*?(a,&,<Tc)' 
fcD = ^M)i-a''{p)i + {af)'(p-g^) 
,2 _ _1 

" "1-3 

and (if necessary) 



(91) 
(92) 



K) = ^(Vl + 2*?((«>+&°-M)^)i-l)(93) 



= (P)i+«°('T°) (/3-eO, (94) 



Kf = ^ {^Jl + 2^f{(p-p.)\- 1) 



(95) 



defined by 



7d 



V-Covi(p,M)' 
provided Covi(p, M) < 0, and tlie functions read 

*?(a,&,crc) = {Qa - {p)lQb) 



(96) 



(97) 
(98) 

(99) 



Similarly as previous section, the estimates of parameters 
a ^ , b ^ and cr^ are obtained from Eq. (91-95) 

by Newton iterative method. 

4.2. Deiermmaiion of Hq 

The terminology "distance limited sample" is meaningless 
in this step, because /j, is not an observable, and thus one 
has to account for selection effects on recession velocities. 
Let us assume that the ry-selection effects are described 
by: 

•hr) a window distribution function 

ij>r,(r]) = 0{V - Vmm)0{Vmax - V)- (100) 

The pd, given in Eq. (6), reads in terms of observable quan- 
tities X, y and rj, see Eq. (1,4,5), as follows 

,p _ <l>m{x + v)<l>,j{v) I , x + y-b ^ 

"-fobs — n / I I \ Jp\ tPoj^p) 



K(ri — 7i)dxdri x gciy', T~i, (T()dy. 



(101) 



where fp and k are given by Eq (13,54,12). The cal- 
culation of the normalization factor Pth(4'm4'v) obvious 
by using the results of previous section. 



Pthi<l>m<l>,^) = A,^Pthi<l>m), 

where Pthi't'm) is given by Eq. (65), and 
Ar,(Ti) = L0ij.{ri^2.x -H) - oji_,(r]^in - Ti). 
Therefore, according to TLR, the //reads 

^2 



(102) 



£pTF(^) = -lnA--f E 

^ k = l 

which shows that the statistic 
must be corrected as follows 



2al 



(105) 



(106) 



term depends on %, which forces us to obtain the %- 
estimate by iterative method. Let us mention that we have 



duA^ = 9jw^(rymin - Ti) - (9jw^(ryn 



(107) 



see Eq. (79,83-85). According to TLR, the statistic given in 
Eq. (106) is Gaussian distributed about 7i with a standard 
deviation 



'C 



(108) 



VN2 

see Eq. (69). 

5. Magnitude of biases Simulations 

In this section, we investigate the biases (i.e., correction 
terms) of statistics defined in TLR when p-selection ef- 
fects, resp. //-selection or ry-selection effects, are present. 
Namely, for the calibration step, let us define 

_ jDTF 



6a^ 



,ITF 
„DTF 



6b^ 

6b^ 



(109) 



see Eq. (27,28,43,44,67,67,91,92) and the biases of Ho- 
statistics : STi.^ is the bias of the ITF statistic given in 
Eq. (50), when a^^^ and are used instead of and b^; 
STi.^ is the bias of the DTE statistic given in Eq. (105). In 
order to have a visual support for our theoretical approach 
and to estimate the magnitude of biases we use numerical 
simulations. The random samples are performed according 
to previous hypothesis. We assume a Hubble's constant of 
Ho = 100 Mpc/km.s~^, a cut-off at apparent magnitude 
of miim = 12, and additional selection effects that depend 
on the TF model. The sample-size is chosen sufficiently 
large for minimizing the effect of statistical fiuctuations 
in the analysis of bias. 

It turns out that these simulations validate the statis- 
tics defined in previous sections, and in particular the effi- 
ciency of the Newton iterative method, since we obtained 
the same values of TF parameters and of Hq that were 
assumed for the simulations. 



(103) 5.1. Simulation of p-limiied samples m the ITF model 

For generating p-limited samples, we assume a Gaussian 
luminosity distribution with Mq = — 19 and <tm = 1-5; a 
TF-diagram defined by a slope of a^^^ = —6 and a zero 



-Pn, (104) point b 



ITF 



-7, with a scatter of a 



ITF 

C 



0.5; the p- 



selection effects described by a window distribution func- 
tion with Pmin = 2 and Pmax = 3. 

The magnitude of biases read 



6b' 



^2^ITF ^1 



1 + 7' 



1 + 7i'tf*2' 
ba'T.lip) - A (.tJ) 



-U{p)i-i},al 



(110) 

(111) 
(112) 



(113) 



It is interesting to note in Eq. (112) that the bias of the 
ITF statistics is canceled if the Cp-cntena (i.e., Cp = 0) is 
fulfilled (see TLR) when the calibration ITF statistics pro- 
vide us with biased estimates. The simulations provided 
us with the following numerical values 



(5a^f«0.21, f« 0.45, bTi^P^QM, 



(114) 



which shows that the bias in estimating the Hubble con- 
stant is ~ 0.5 percent. 

5.2. Simulation of jj, -limited samples m the DTF model 

For //-limited samples, we assume a Gaussian p- 
distribution with po = 2.5 and <Tp = 0.25; a TF-diagram 
defined by a^TF ^ jDTF ^ _^ ^dtf ^ o.5; 

for the calibration sample the //-selection effects described 
by a window distribution function with /«min = 30 and 
yU^jjj, = 32; for the sample used for estimating Hq, the 
ry-selection described by a window distribution function 
with rymm = 40 and rymax = 42. 
The magnitude of biases reads 



^flD = 2 ^DTF ^l +^2 

° 1-7^*? 



(115) 



8lP = -8a^{p)i + fiba"^ {2a'^ - ba"^) - q, {a'^f ,(116) 
8n^ = Cp8a°^i{p) + pba"^ {2(7"^ - 8(7^) - Qi K)^U7) 

where Cp is given in Eq. (113). The simulations provided 
us with the following numerical values 



(5a°f«0.15, (5&°f«0.01, Sn^^O.U, 



(118) 



which shows that the bias in estimating the Hubble con- 
stant is ~ 14 percent. 

6. Conclusion 

This statistical model of the Hubble fiow, when a Tully- 
Fisher (TF) type relation is used for estimating the ab- 
solute magnitude M ^ a p + h from a line width distance 
indicator p, is in agreement and completes the previous 
results obtained in Triay et al. (1994) (TLR). Namely, 
the "Direct" and the "Inverse" TF methods identify to 
maximum likelihood approaches corresponding to differ- 
ent models of the TF diagram. Hence, coherent estimates 
of model independent parameters, such as Hq and the 
galaxies distances, are obtained from (model dependent) 
unbiased statistics as long as the same model is used in 
the calibration step. The choice of the model should be 
motivated solely by criteria of accuracy and robustness of 



robust the statistics), which depends on selection effects 
in observation. For the calibration of the TF relation, if 
p-selection effects are not present then the most attractive 
approach for the determination of Hq corresponds to the 
ITF model, because not information on the data sample 
is required (about the spatial distribution of sources, the 
luminosity distribution function and the sample complete- 
ness on apparent luminosity and distance) for obtaining 
the calibration statistics. Otherwise, in addition of these 
characteristics, one has to be able to describe all the se- 
lection effects by specifying the form of the corresponding 
selection functions. In the DTF model, the data charac- 
teristics must be wholly specified for the derivation of un- 
biased calibration statistics. 

The present investigation provides us with unbiased 
statistics for the ITF model with p-selection effects and for 
the DTF model with selection effects on distance. While 
the calculation for obtaining unbiased statistics within 
both models which accounts simultaneously for these two 
selection effects is straightforward, it is not performed in 
our analysis. The formulas are derived by assuming usual 
working hypotheses : a Gaussian luminosity distribution 
function, a power law spatial distribution of sources, com- 
pleteness up to a limiting apparent magnitude and win- 
dow selection functions for describing the p-selection ef- 
fects and selection effects on distance. These statistics has 
been checked successfully by numerical simulations by us- 
ing random samples with usual characteristics. The bi- 
ases due to these selection effects when using the previ- 
ous statistics (given in TLR) are investigated, and their 
expected magnitudes are provided by the simulations. It 
turns out that the bias in estimating the Hubble constant 
within the ITF model can be removed by using the Cp- 
criteria. It is interesting to note that this bias is much 
weaker (0.5 percent) in the ITF model than the one (14 
percent) in the DTF model, while the order of magni- 
tude of biases in the calibration parameters are larger than 
those in the DTF model. 

Finally, let us emphasize that such a statistical frame- 
work can be used for obtaining likely distance estimates 
of galaxies in the usual sense'*, see Triay (1993). 
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A. Notations and useful formulas 

The following features are addressed throughout the text by 
using the symbol "_De/,". 
Def.l The probability density (pd) of a random variable x reads 
dP{x) = f{x)dx, where f{x) represents the pd function 



It is interesting to mention that these characteristics are 
almost antagonistic. 

* While it is evident that a more sensible approach requires 
a statistical modeling of the underlying dynamics of cosmic 
velocity fields. 



exhibit the model parameters involved in the statistical 
model, as the mean xo and the standard deviation a, by 
writing f{x;xo,cr). 

(a) gG(x;xo,o') = ((71/277)""' exp — ((x — xo)^/(2(7^)) is a 
Gaussian pdf. 

(b) g-Rix) = gG{x;0,l) is a Normal pdf. 

(c) Af{x) = g'N{t)dt is the cumulative Normal pdf. 
Def.2 Let / be a pdf and A be a scalar value, in most of calcula- 
tions, we use the following properties : 

(a) f{x + A; Xo, cr) = f{x; xo — A, a); 

(b) f{Xx;xo,a) = X-'f{x;f,f ); 

(c) exTp{\x)gG{x;xo, a) = exp (^\{xo + \^}^ gaix; xo + 
A(7^(7). 

(d) gajx; xi,ai)g G{x; X2, (J2) = gaix; xo, (Jo)gG{xi; X2, &), 
where a = erf + (t| , xo and ao are defined as follows 

= a^^ + (tJ^ and xocr^^ = xia^^ + X2(tJ^. 
Def.3 P{h) = J h{x)dP{x) denotes the expected value of the 
function h(x). 

Def.A The pd of a sample data {Qk}j.-^ j^, which consists of N 
independently selected objects Qk = {xk,yk,...), is given 
by YlLi dP{Qu)- 

(a) Its pdf, written in terms of observable quantities 
Xk,yk,. . . , but regarded as a function of model param- 
eters, provides us with the likelihood function. 

(b) ( The ML method.) The statistics of model parameters 
are obtained by maximizing the likelihood function, or 
(equivalently) its natural logarithm. Actually, the terms 
which do not contribute to the determination of param- 
eters are removed from it. Such a quantity is called the 
efficient part of likelihood function, herein simply de- 
noted If. 

Def.5 We have the usual definitions : 

(a) (x) = Xk/N is the sample average, 

(b) Coy{x,y) = Ef=i(^* - {^))iyk - {y})/iN - 1) is the 
covariance, 

(c) E(x) = Cov(x, x) is the standard deviation, 

(d) p{x,y) = Cov(x, j/)/(E(x)E(j/)) is the correlation coef- 
ficient. 

(e) These symbols are written as follows Ns, {.}s, CoVs(., .), 
Es(.) and Ps{., .) in order to distinguish between the 
calibration step (s = 1), and the _ffo-determination (s = 
2). 

Def.6 The accuracy of an estimator is formally defined as the 
reciprocal of its variance (The smaller the dispersion, the 
greater the precision.). 



Velocity Fields", 9th lAP Astrophysics Meeting, eds. F.R. 

Bouchet & M. Lachieze-Rey (Editions Frontieres). 
Rauzy S. 1994, in "The World of Galaxies II" (Lyon) eds. G. 

Paturel, Astrophys. Letters&Communications (Gordon and 

Breach), in press. 
Rauzy S. and Triay R., submitted to A&A. 
Schechter P.L. 1980, A.J. 85,801 

Strauss M.A., Willick J. A. 1994, Physics Report, in press. 

Teerikorpi P., 1975, A&A 45,117 

Teerikorpi P., 1982, A&A 109, 314 

Teerikorpi P., 1984, A&A 141, 407 

Teerikorpi P., 1987, A&A 173, 39 

Teerikorpi P., 1990, A&A 234, 1 

Teerikorpi P. 1994, in "The World of Galaxies II" (Lyon) eds. 

G. Paturel, Astrophys. Letters&Communications (Gordon 

and Breach), in press. 
Triay R., Lachieze-Rey M., Rauzy S. 1994, A&A 289,19 
Triay R. 1993, in "Cosmic Velocity Fields", 9th lAP Astro- 
physics Meeting, eds. F.R. Bouchet & M. Lachieze-Rey 

(Editions Frontieres). 
Triay R. 1994, in "The World of Galaxies II" (Lyon) eds. G. 

Paturel, Astrophys. Letters&Communications (Gordon and 

Breach), in press. 
TuUy, R.B. 1988, in "The Extragalactic Distance Scale", ASP 

ConL Ser. 4, 318 
TuUy, R.B. 1988, Nature 334, 209 
TuUy, R.B., Fisher, J.R. 1977, A. A. 54,661 
Willick, J. A. 1994, ApJS 92,1. 



References 

Bottinelli L., Gouguenheim L., Paturel G., de Vaucouleurs G. 

1985, A&AS 59,43 
Bottinelli L., Gouguenheim L., Paturel G., Teerikorpi P., 

1986a, A&A 156,157 
Bottinelli L., Gouguenheim L., Paturel G., Teerikorpi P. 1986b, 

A&A 156, 393 

Bottinelli L., Gouguenheim, L., Teerikorpi P. 1988a, A&A 196, 
17 

Bottinelli L., Gouguenheim L., Paturel G., Teerikorpi P. 1988b, 
ApJ 328, 4 

Faber, S.M., Jackson, R. 1976, ApJ 204,668. 



This article was processed by the author using Springer- Verlag 
Wr^X A&A style file L-AA version 3. 



